RAID CONSISTENCY INITIALIZATION METHOD 
DESCRIPTION 



BACKGROUND OF THE INVENTION 
[Para 1 ] 1 .Field of the Invention 

[Para 2] The invention relates to a redundant array of independent disks 
(RAID) consistency initialization method. 

[Para 3] 2. Description of the Prior Art 

[Para 4] In a computer system, a redundant array of independent disks 
subsystem (RAID subsystem) is used to provide data storage space having high 
availability, high performance, or high volume to the host entity. 

[Para 5] A prior art RAID subsystem comprises a RAID controller, and a RAID 
composed of a plurality of hard disc drives (HDD) electrically connected to the 
RAID controller. Furthermore, the RAID controller defines the RAID as one or 
more logical disk drives of RAID-level 1 , 3, or 5. 

[Para 6] A RAID subsystem has to perform the process of RAID creation to 
define the RAID itself, and then is presented to the host entity. At this time, 
the RAID is available to the host entity or on line so that the host entity is 
capable of accessing the RAID. When the RAID is not available to the host 
entity or not on line, the host entity does not know the existence of the RAID 
and can not access the RAID. 

[Para 7] The above-mentioned RAID creation is shown in Fig.lA. When RAID 
creation is started (1 00), RAID configuration is firstly set (1 1 0), and then a 
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consistency initialization is performed (1 50). After tPie consistency initialization 
is completed, the RAID configuration is written to the member disks (1 30). 
Meanwhile, the RAID creation Is regarded as being completed (140), and the 
RAID is then presented to the host entity (1 80) so that the RAID is available to 
the host entity. The operation of the consistency initialization is to make user 
data and check data consistent. The consistency initialization needs to be 
performed because only when the member disks had been performed with 
consistency initialization, the data stored therein can be regenerated while the 
member disk is out of order. Please refer to Fig.l B, which is a flow chart of a 
consistency initialization process according to the prior art. The consistency 
initialization is executed by performing initialization block-by-block from 
block 0 to the last block. That is, the consistency initialization is executed by 
performing initialization on block 0 firstly, and then on block 1 after block 0 is 
completely executed, and the consistency initialization is executed on the 
blocks one after another until the last block is completely executed. 

[Para 8] In the prior art method of RAID creation and the consistency 
initialization, several problems are illustrated as follows. First, the consistency 
initialization spends a lot of time, such as a few hours, or even a few days. 
However, as mentioned above, before the consistency initialization is 
completed and the RAID is presented to the host entity, the RAID is not 
available to the host entity or not on line so that the RAID can not be accessed. 
In other words, it takes the users at least several hours to wait until the RAID 
subsystem can be used since the execution of the RAID creation. This makes 
users/customers very inconvenient. 

[Para 9] Therefore, how to complete the RAID creation fast and present the 
RAID subsystem to the host entity or allow the RAID to be accessed by I/O as 
soon as possible becomes a serious problem to be solved. 
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SUMMARY OF THE INVENTION 

[Para 1 0] It is therefore a primary objective of the claimed invention to 
provide a method which can quickly performs the RAID creation and make the 
RAID allow I/O accessing, to solve the above-mentioned problem which makes 
users wait for the completion of the consistency initialization. 

[Para 1 1] According to the claimed invention, consumed time of waiting the 
completion of the consistency initialization of a RAID subsystem is eliminated, 
and simultaneously, the initialization can be performed on the data area to 
erase data, which exists previously, on the data area in order to prevent 
unexpected behavior of the host entity caused by the remained data. 

[Para 12] According to an embodiment of the present invention, the method 
for redundant array of independent disks (RAID) consistency initialization 
comprises: creating RAID, including setting a RAID configuration of the RAID 
and creating an initialization progress table for storing progress states of the 
initialization of the RAID, wherein after the initialization progress table is 
created and before the completion of the consistency initialization is 
completed, the RAID is allowed to access and the consistency initialization is 
allowed to start. 

[Para 1 3] According to another embodiment of the present invention, if a 
member disk failed and a new member disk is used to perform a rebuilding of 
the RAID before the completion of the consistency initialization, the rebuilding 
only has to perform on the regions which have been completed with the 
consistency initialization and the rebuilding on the regions which have not 
been completed with the consistency initialization can be performed by the 
consistency initialization. 
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[Para 14] According to still another embodiment of the present invention, 
when an I/O operation accessing the RAID is a read operation and a region on 
the RAID to be accessed by the I/O has not been Initialized yet, no consistency 
initialization is performed on the region, and a value of zero or a 
predetermined value will be returned directly. 

[Para 1 5] These and other objectives of the claimed invention will no doubt 
become obvious to those of ordinary skill in the art after reading the following 
detailed description of the preferred embodiment that is illustrated in the 
various figures and drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[Para 16] Fig.lA illustrates a RAID creation according to the prior art; 

[Para 1 7] Fig.1 B illustrates a process of a consistency initialization according 
to the prior art; 

[Para 1 8] Fig.2A is a diagram of an embodiment of a RAID consistency 
initialization method according to the present invention; 

[Para 1 9] Fig.2B is a diagram of another embodiment of a RAID consistency 
initialization method according to the present invention; 

[Para 20] Fig.2C is a diagram of still another embodiment of a RAID 
consistency initialization method according to the present invention; 

[Para 21] Fig. 3 is a block diagram of a RAID configuration setting of an 
embodiment according to the present invention; 

[Para 22] Fig. 4 is a diagram of a RAID of an embodiment according to the 
present invention; 

[Para 23] Fig.SA is a flow chart of an embodiment of creating an initialization 
progress table according to the present invention; 
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[Para 24] Fig.SB and Fig.SC illustrate an embodiment of defining initialization 
regions of the RAID shown in Fig. 4 by the method shown in Fig. 5 according to 
the present invention; 

[Para 25] Fig.6A is an embodiment of a consecutive consistency initialization 
according to the present invention; 

[Para 26] Fig.6B is another embodiment of a consecutive consistency 
initialization according to the present invention; 

[Para 27] Fig.7A is an embodiment of a regional initialization priority 
adjustment mechanism according to the present invention; 

[Para 28] Fig.7B is another embodiment of a regional initialization priority 
adjustment mechanism according to the present invention; 

[Para 29] Fig. 8 is an embodiment of an induced consistency initialization 
according to the present invention; 

[Para 30] Fig.9A and Fig.9B are another embodiment of an induced 
consistency initialization according to the present invention; 

[Para 31] Fig. 10 is a diagram of restarting a RAID subsystem when the RAID 
creation is completed but the consistency initialization has not been 
completed yet according to the present invention; 

[Para 32] Fig.l 1 is a diagram of loading an initialization progress table when 
the RAID subsystem is restarted according to the present invention; and, 

[Para 33] Fig. 1 2 is a diagram of an embodiment of identifying a correct 
initialization progress table when the initialization progress table is loaded in 
Fig.l 1 according to the present invention. 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

[Para 34] The present invention provides a RAID consistency initialization 
method. The term "RAID (redundant array of independent disks)" used in the 
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present invention broadly includes various RAID levels, such as NRAID, RAIDO, 
1,2,3, 4, 5, 6 and the combination thereof. 

[Para 35] A RAID subsystem of the present invention comprises one or more 
RAID controllers, and one or more RAID arrays connected to the RAID 
controllers. The RAID controllers define the RAID array as one or more logical 
medium units (LMU) of one or various RAID levels and/or the combination of 
the RAID levels. This improves the data availability, the performance of data 
access, or increases the data storage volume. The RAID includes a plurality of 
physical storage devices (PSD), such as hard drives or other physical devices, 
for storing data. The possible RAID levels can be NRAID or RAID 0, 1 , 2, 3, 4, 
5, 6, etc. 

[Para 36] In other words, a RAID subsystem can divides a plurality of PSDs 
into one or more LMUs of various RAID levels to make the RAID on-line, or to 
present the LMU or LMUs to the host entity and thus available to the host 
entity so that the host entity can access the RAID subsystem. The host entity of 
the RAID controller can be a host computer or another RAID controller, 
depending on the configuration of the RAID subsystem. Furthermore, the RAID 
controllers can be configured into a redundant RAID controller pair to improve 
the availability of the RAID subsystem. 

[Para 37] Before a RAID is presented to the host entity or is on-line to accept 
I/O access, the RAID creation has to be completed first. According to an 
embodiment of the present invention, , the RAID subsystem creation can start 
by commanding the RAID controllers. For example, a keyboard can be used to 
enter the commands to the RAID controller through RS232 port, or control 
buttons on the chassis of the RAID subsystem can be used to enter the 
commands to the RAID controller. 
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[Para 38] Please refer to an embodiment of a RAID consistency initialization 
method according to the present invention shown in Fig.2A. In this 
embodiment, when the RAID controller receives the CREATE RAID command, it 
first sets the RAID configuration (21 OA) and then creates an Initialization 
progress table In a memory (220A) In order to store the progress states of the 
initialization. Then, the initialization progress table and the RAID configuration 
are written into a non-volatile disk (230A). RAID creation is therefore 
completed (240A). At this time, the consistency Initialization (250A) can be 
started. 

[Para 39] The consistency Initialization (250A) comprises two parts: 
consecutive consistency Initialization (260A) and Induced consistency 
initialization (290A). Consecutive consistency initialization is the consistency 
initialization that can be started to consecutively proceed in the background 
immediately after the RAID creation Is completed or on an appropriate 
opportunity after the RAID creation Is completed, such as when the RAID Is 
presented to the host or after host I/O accesses the RAID. Induced consistency 
initialization is the consistency initialization that is induced by the access of an 
I/O to the RAID on a region which is not initialized, after the RAID creation is 
completed (280A). Because the RAID creation Is very fast, for example, several 
seconds to several minutes, there's no need to wait since entering the RAID 
creation command till the completion of RAID creation. In which the prior art 
problems are completely avoided. The consistency initialization does not affect 
the access of the host entity to the RAID subsystem because it is performed in 
the background. Each RAID can be partitioned as one or more LMUs, and when 
the RAID creation Is completed, these LMUs can be presented to the host 
entity. The present Invention method Is not affected by the partition. In the 
present embodiment, although the RAID configuration Is written Into member 
disks (230A), it can be written into other types of non-volatile memory device. 
Please note that the aforementioned non-volatile memory device can be a 
member disk, a disk drive other than the member disk, a battery backed-up 
SRAM, or a flash RAM. In conclusion, the non-volatile memory device Is a 
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memory device which can maintain the data stored therein when the power of 
the memory device is cut off. 

[Para 40] Please refer to another embodiment of a I^ID consistency 
initialization method according to the present invention shown in Fig.2B. In 
this embodiment, when the RAID controller receives the CREATE RAID 
command (200B), it first sets the RAID configuration (21 OB) and then creates 
an initialization progress table in a memory (220B) in order to store the 
progress states of the initialization. Then, the initialization progress table and 
the RAID configuration are written into member disks (230B). RAID creation is 
therefore completed (240B). At this time, the consistency initialization (2 SOB) 
can be started. In the present embodiment, after the RAID creation is 
completed, the RAID is presented to the host entity (280B) first and then the 
induced consistency initialization (290B) is performed . When the host I/O 
accesses an address on the RAID which is not initialized, an induced 
consistency initialization occurred. Because the RAID creation is very fast, for 
example, several seconds to several minutes, there's no need to wait since 
entering the RAID creation command till the completion of RAID creation. Since 
the consistency initialization is performed in the background, it does not affect 
the access of the host entity to the RAID subsystem. In the present 
embodiment, although the RAID configuration is written into member disks 
(230B), it can be written into other types of non-volatile memory device as 
those mentioned in step (230A). 

[Para 41 ] Please refer to still another embodiment of a RAID consistency 
initialization method according to the present invention shown in Fig.2C. In 
this embodiment, after the RAID CREATE is started (200C) and the RAID 
configuration is set (2 IOC), an initialization progress table is created in a 
memory (220C). Then, the initialization progress table and the RAID 
configuration are written into member disks (230C). RAID creation is therefore 
completed (240B). At this time, the consistency initialization (250B) can be 
started. The consistency initialization (250C) comprises two parts: consecutive 
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consistency initialization (260C) and induced consistency initialization (290C). 
In the induced consistency initialization, it is not necessary to consider 
whether the RAID Is presented to the host or not, since the cause Inducing the 
induced consistency Initialization can be a host I/O or an I/O other than a host 
I/O. For example, when the RAID Is storage portion of a large system, the I/O 
accessing the I^ID might be an I/O to store data outside the system or an I/O 
to store data of the large system Itself into the RAID. The large system, e.g., is 
a NAS (Network Attached Storage). 

[Para 42] As shown In the above-mentioned Flg.2A to Flg.2C, the consistency 
initialization can be performed after the RAID creation Is completed and the 
RAID can be accessed by an I/O. 

[Para 43] Please refer to Fig. 3, which Is a block diagram of an embodiment of 
a RAID configuration setting according to the present invention. As shown in 
Fig. 3, the aforementioned RAID configuration may comprise the steps of 
defining the RAID unique ID (310), the RAID level (320), the member disks 
(330), the reserved space (340), the striping size (350), and so on. 

[Para 44] Each member disk has a data space and a reserved space, where the 
data space Is used to store user data and/or check data and the reserved 
space Is reserved for the RAID to store the Information of the RAID 
configuration or other Information. Please note that the reserved space Is not 
used to store user data or check data. 

[Para 45] The RAID unique ID is used for a RAID controller to check whether a 
disk drive belongs to a specific RAID or not. When the RAID subsystem is 
restarted while the power Is resumed after accidentally cut off or due to some 
other unpredlcted situations, the RAID controller will read data from the 
member disks. If the disks are not checked first as to whether It belongs to 
this RAID, a serious error may occur when data in the disks is used. The RAID 
configuration may also comprise write policy setting, (for example, selection of 
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write through mode or write bacl< mode), RAID size setting, or other 
information. Please note that the RAID configuration is stored in the above- 
mentioned reserved space of each member disk. 

[Para 46] Please refer to Fig.4, which is a diagram of an embodiment of a 
RAID according to the present invention. As shown in Fig.4, the RAID 
comprises three member disks, DISKO- DISK2. Furthermore, the RAID is 
partitioned into two LMUs, LMUO and LMUl . The RAID is composed of a 
plurality of stripes, where each stripe comprises a check data chunk (here the 
check data chunk is a parity data chunk and is labeled as P) and two other 
chunks, which are user data chunks (labeled as D). 

[Para 47] In Fig.4, the RAID is a RAID level-5 RAID. Therefore, the data space 
is used to store user data and check data. In an NRAID or RAID level-0 RAID, 
there is no check data, so the data space is used to store user data. In 
addition, in a RAID level-1 RAID, although data stored in two chunks of the 
same stripe are both user data, the stripe can still be regarded as having a 
user data chunk and a check data chunk because the user data of the RAID 
level-1 RAID can be used for checking. Therefore, the data space can be 
regarded as for storing both user data and check data. 

[Para 48] Please refer to Fig.SA, which is a flow chart of an embodiment for 
creating an initialization progress table according to the present invention. As 
shown in Fig.SA, a value N is first determined as the number of regions the 
member disks of the RAID are to be divided into (510), where each region is a 
basic unit for performing consistency initialization. Then an initialization 
progress table is created having N fields to record the initialization state of 
each initialization region (520). After the initialization progress table is written 
to each member disks (530), and the process of initialization progress table 
creation is completed. The above-mentioned basic units are called 
initialization regions and the initialization on each initialization region is called 
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regional initialization. The present invention consistency initialization is 
performed by performing regional initialization on each initialization region. 
When the regional Initialization on all Initialization regions Is completed and 
the completion state Is stored In the reserved space of the member disks, the 
consistency Initialization Is completed. 

[Para 49] Please refer to FIg.SB and FIg.SC, which Illustrate an embodiment of 
defining initialization regions of the RAID shown in Fig. 4 by the method shown 
in Fig.SA of the present invention. As shown in Fig.SB, the data space of the 
member disk of the RAID Is divided Into N Initialization regions of the same 
size, where each Initialization region Includes 3 sections each belonging to a 
different member disks. As shown In FIg.SC, each Initialization region Includes 
4 stripes, so that each section includes 4 chunks. Each of the 4 stripes of the 
same initialization area has 3 chunks, which includes 2 user data chunks D and 
1 check data chunk P, wherein the check data chunk Is a parity data chunk. 
The value stored In the parity data chunk Is the excluslve-OR value of the 
values stored In the user data chunks. The check data chunks can be In the 
same section, or spread in different sections. 

[Para 50] Although in the above-mentioned embodiments, the initialization 
region is larger than a stripe and includes 4 stripes, the size of an initialization 
region, in general. Is not necessarily larger or smaller than that of a stripe. 

[Para 51] The considerations for determination of the above-mentioned N are 
as follows. First, when N is large, the number of initialization regions of the 
RAID is large, and each initialization region Is small. Therefore, more reserved 
space Is needed to store the Initialization progress table, but the time of 
performing regional Initialization on each Initialization region Is less and the 
host I/O can be performed without waiting too long. Oppositely, If N Is small, 
the number of initialization regions of the RAID is small and each initialization 
region is large. Therefore, only a small reserved space Is needed to store the 
initialization progress table, but each Initialization region needs a lot of time 
to be Initialized and the host I/O has to wait for a long while for the 
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completion of the regional initialization. For example, if a storing space of a 
member disk is 256G bytes, and if each initialization region is 256K bytes, the 
N is determined as IM (1*1024*1024). Here, if the initialization progress state 
of each initialization region is indicated by only 1 bit, the initialization 
progress table occupies at least 1 M bits (1 28K bytes). In another example, if 
each initialization region is lOM bytes, and if the RAID has 10 member disks, 
which are connected to the same channel of a channel speed of 200M 
bytes/second (for a typical Fibre channel), the initialization of each 
initialization region needs at least 1 OM bytes*l 0/200M Bytes/second 
=0.5second. That is, the host I/O has to wait at least 0.5 second for the 
initialization, which makes the host I/O too slow. 

[Para 52] In the above-mentioned embodiments, the initialization regions are 
of the same size, but in another embodiment, the initialization regions may be 
of different sizes. 

[Para 53] When in use, the initialization progress table is stored in the 
memory of the RAID controller, and the initialization progress state is always 
updated in the initialization progress table. Furthermore, the initialization 
progress table will be stored in each member disk later. After the RAID 
creation is completed and before the consistency initialization is completed, if 
the RAID subsystem has to be restarted when the power is resumed after 
suddenly cut off or due to some other unexpected reasons, the initialization 
progress tables stored in the member disks can be read into the memory first 
and a correct initialization progress table will be identified through CRC check. 
Therefore, the uncompleted consistency initialization can be continued 
according to the correct initialization progress table. For example, the 
initialization state of each initialization region can be indicated by one bit 
stored in the initialization progress table, where if the initialization region is 
initialized completely, the bit is 1 , otherwise, the bit is 0. That the bit is 0 
means that either the initialization has not been started yet or the initialization 
has been started but yet not completed. 
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[Para 54] When the RAID creation is completed and the defined RAID is 
presented to the host entity as LMU(s) so that the RAID subsystem is available 
to the host entity, the RAID subsystem starts to perform the consistency 
initialization according to a predetermined schedule. This is the above- 
mentioned consecutive consistency initialization, which is performed 
continuously until the consistency initialization is completed. However, when 
the host entity accesses a RAID and a associated RAID controller receives a 
host I/O command and parses the command to access a initialization region 
corresponding to the host I/O command, if the initialization region is not 
initialized, the regional initialization is induced to be performed due to the 
host I/O command. This is the above-mentioned induced consistency 
initialization. 

[Para 55] Because the consecutive consistency initialization (shown in Fig.6A), 
which is performed according to a predetermined schedule, and the induced 
consistency initialization (shown in Fig. 8, Fig.9A, and Fig.9B), which is induced 
by the host I/O, have no direct relation between them, the consecutive 
consistency initialization and the induced consistency initialization are 
performed separately. The only concern is if either one of the consecutive 
consistency initialization and the induced consistency initialization is 
performed on a particular initialization region, the other cannot be performed 
on the same initialization region such that data stored therein will not be 
destroyed. 

[Para 56] Please refer to Fig.6A, which is a diagram of an embodiment of a 
consecutive consistency initialization according to the present invention. First, 
an initialization region is selected to be initialized (605A), and then the 
selected initialization region is tested to determine whether the initialization of 
the selected initialization region has been completed (61 OA). If it is affirmative 
in (61 OA), determining if there is another initialization region which has not 
yet completed initialization (61 5A). If it is affirmative in (61 5A) indicating that 
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there is an initialization region wPiicli has not yet completed with the 
initialization, then step (605A) is performed. On the other hand, if it is 
negative in step (61 5A), which means there is no initialization region which 
has not been initialized yet, waiting for the completion of the initialization of 
all initialization regions. Therefore, the information of the completion of all 
RAID initialization regions is stored into the member disk (620A), and then the 
consistency initialization is completed (690A). If it is detected that the 
initialization on this region has not been completed In step (61 OA), the 
selected initialization region Is tested to see If It Is now being initialized 
(62 5A). If In step (62 5A) It Is affirmative, which means that the selected 
initialization region is now being initialized, the process can wait for the 
completion of the initialization, or the selected initialization region can be 
skipped directly and the step (61 5A) can be performed to determine if there is 
any initialization region of which the initialization is not completed. On the 
contrary, if in the step (62 5A) it is negative, which means that the selected 
initialization region has not been initialized yet, the selected Initialization 
region is initialized immediately (635A), and then the initialization progress 
state stored in the initialization progress table (640A) is updated. The 
initialization progress table Is then stored In the member disks (645A) in a 
suitable time after the completion of the regional Initialization. The suitable 
time can be the timing when the whole regional Initialization process Is 
performed to a certain extent, for example, when a predetermined number of 
initialization regions or a predetermined percentage of the initialization 
regions is completed with the regional initialization, or after a predetermined 
time has elapsed since the initialization progress table was stored In the 
member disk last time. When the step (645A) is completed, the step (61 5A) is 
performed to see if there is any initialization region not completed with the 
initialization, and related procedure is performed until all initialization regions 
are completed with the initialization (690A). In step (61 OA), the completion of 
the regional initialization Includes completion of updating the Initialization 
progress state to the initialization progress table. In step (62 5A), the state of 
being initialized of the regional progress Initialization comprises updating the 
initialization progress state to the initialization progress table. 
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[Para 57] In step (64 5A), the purpose of writing tlie initialization progress 
table into the member disk in a suitable time is to raise the performance. 
Because the initialization region is in the data space while the initialization 
progress table is in the reserved space, a pickup head of a disk drive has to 
move between the two spaces when initialization process and writing of the 
initialization progress table into the disk drive are performed at the same time. 
This causes a bad performance. Please note that in step (645A) of writing the 
initialization progress table into the member disk, it can simply update the 
changing part of the initialization progress table instead of writing the whole 
initialization progress table into the member disk. 

[Para 58] The reason why the test step (61 OA) is performed is that some 
initialization regions have been initialized because of some reasons and do not 
have to and cannot be initialized again since initialization of these initialization 
regions has been completed and apparently does not have to be done again, 
and moreover, there may be some data generated due to the host I/O stored in 
the initialized regions and would be destroyed, for example, reset to zero, if 
initialization is performed again. One of the possible reasons mentioned above 
why the initialization of some regions have been completed is that when the 
host entity performed I/O operation on a RAID, the associated initialization 
regions which had not been initialized yet started to be initialized, and thus 
some of these initialization regions have been already completed with 
initialization. 

[Para 59] The step (62 5A) is performed because some initialization regions 
may be now being initialized but not completed due to some reasons. These 
initialization regions do not need to be initialized again because they are 
"being" initialized. One of the possible reasons mentioned above why the 
initialization of some regions are being initialized is that when the host entity 
performed I/O operation on a RAID, the associated initialization regions which 
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had been initialized yet started to be initialized, and thus some of these 
initialization regions are being . 

[Para 60] In step (605A) of selecting an initialization region to perform the 
initialization, it can be of a predetermined way to perform the regional 
initialization. For example, the regional initialization can be started from a 
certain initialization region, and sections of the certain initialization on the 
different PSDs are initialized one by one, and then proceeds to another 
initialization region, on the sections thereof. The initialization progress state 
of each region in the initialization progress table is updated. At last, all 
initialization regions are initialized completely and the consistency 
initialization is completed. The order of the regional initialization can be 
determined according to the order of the addresses, such as from a small 
address to a large address, or from a large address to a small address, or a 
predetermined order, or even not according to a specific order, such as in 
random. In other words, any method capable of selecting the initialization 
region that is not completed with the initialization can be used in the present 
invention. 

[Para 61] According to an embodiment of the present invention, the regional 
initialization can define all user/check data space of the member disks to a 
predetermined state, and make the user data consistent with the check data. 
This can prevent from unexpected behaviors of the host resulted from the fact 
that the user data are not predefined. According to another embodiment of the 
present invention, the regional initialization can simply define the user data 
into a predetermined state, especially for the NRAID or RAID level 0 RAID 
(because no check data exists therein). 

[Para 62] Although the member disk may fail when the consistency 
initialization is performed, the data in the failed member disk can be 
regenerated from other member disks. For example, in a RAID level 1,2,3, 4, 



Page 1 6 of 48 



5 RAID, data can be regenerated from the user/check data of other member 
disks. However, in a NI^ID or I^ID level 0 RAID, there is no check data for 
protection of user data. If data rebuilding has to be performed during the 
consistency initialization, only initialization regions on which the initialization 
has been completed will perform the data rebuilding. The other initialization 
regions which have not been completed with the consistency initialization will 
perform the data rebuilding through the initialization. 

[Para 63] Please refer to Fig.6B, which is a diagram of another embodiment of 
a consecutive consistency initialization according to the present invention. 
This embodiment shown in Fig.6B is quite similar to the embodiment shown in 
Fig.6A. The only difference is that after the step (61 5B) when there is a region 
needed to perform regional initialization, the regional initialization priority 
adjusting mechanism is performed (660B) first, and then the step (605 B) is 
performed. 

[Para 64] The strategy in the embodiment shown in Fig.6B is explained as 
follows. If there is no I/O command from the host entity to the RAID controller, 
the initialization is performed in a predetermined way. If there exists an I/O 
command from the host entity to the RAID controller, the regional initialization 
corresponding to the I/O command mapped from the host I/O can be 
performed. Moreover, the priority of the regional initialization induced by the 
I/O command can be raised in the initialization. 

[Para 65] Several embodiments of the priority adjusting mechanism (660B) 
can be seen in Fig.7A and Fig.ZB. In Fig.7A, the priority adjusting mechanism 
detects whether there is a host I/O accessing a RAID which is performing the 
initialization. If there is, the regional initialization is performed after all 
associated host I/O is completed. 



Page 1 7 of 48 



[Para 66] In Fig.7B, the priority adjusting mecPianism (660B) detects wPietlier 
tPiere is a host I/O to access a RAID which is performing the initialization. If 
there is, the associated host I/O and the regional initialization can be 
performed in a predetermined way, for example, two induced consistency 
initialization induced by the host I/O can be performed first, and then one 
conductive consistency initialization is performed. According to one 
embodiment of Fig.7B, the predetermined way can be performing the regional 
initialization and the host I/O in stagger. According to another embodiment of 
Fig.ZB, the predetermined way can be that there is an upper limit to the 
number of concurrent regional initializations , such as 3, and there is another 
upper limit to the number of concurrent regional initialization induced by the 
host I/O ,such as 2, to maintain the performance of the system. 

[Para 67] Please refer to Fig. 8, which is a diagram of an embodiment of an 
induced consistency initialization according to the present invention. When the 
RAID controller receives a host I/O to access the RAID, the RAID controller will 
test whether the consistency initialization is completed (805). If in step (805) it 
is affirmative, which means that the RAID initialization is completed and the 
initialization information is stored in the member disk and the consistency 
initialization is over, the RAID can be accesses according to the host I/O. If in 
step (805) it is negative, the initialization table is checked to determine 
whether the initialization region corresponding to the host I/O is completed 
with initialization (81 0). If step (81 0) is affirmative, and then it is tested to see 
if the initialization progress table has been stored in the member disk (860). If 
step (810) is negative, testing if the associated initialization regions are being 
initialized (81 5). If step (81 5) is affirmative, step (820) of waiting the 
completion of the initialization region is initialized is performed and then step 
(860) of detecting whether the initialization progress table is stored in the 
member disk is performed. If step (81 5) is negative, the initialization is 
performed on the initialization region corresponding to the host I/O (825), and 
then the initialization progress table is updated to the memory (830), and 
further, checking if the initialization progress table is stored in the member 
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disk is performed (860). If step (860) is affirmative, wPiicli means tPiat tPie 
initialization progress table is stored in the member, the RAID is accessed 
according to the host I/O (850). If step (860) is negative, testing whether the 
initialization progress table is being stored to the member disk (865). If step 
(865) is affirmative, the step (875) of waiting until the initialization progress 
table is stored in the member disk is performed. If the step (865) is negative, 
the initialization progress table has to be stored to the member disk (870) 
first, and then the step (875) of waiting until the initialization progress table is 
stored in the member disk is performed. After the step (875) is completed, the 
step (850) is performed. When the host I/O accessing the RAID is completed, 
the RAID controller reports the result of the host I/O to the host entity (885), 
and the host I/O is completed (890). Please note that in step (870), when 
writing the updated initialization progress table into the member disk, we can 
simply update the changed part of the initialization progress table or we can 
write the whole initialization progress table into the member disk. When the 
write policy of the RAID is set to write back mode, it is not necessary to 
perform step (885) to reports the result of the host I/O to the host entity since 
it has been reported when the RAID controller received the host I/O. 

[Para 68] Although in Fig. 8, the updated initialization progress table is written 
into the member disk in step (870), in another embodiment of the present 
invention, however, the updated initialization progress table can be written 
into the member disk immediately after the initialization progress table has 
been updated in the memory (830). This does not change the function of the 
present invention if other steps keep unchanged. Generally speaking, the 
updated initialization progress table has to be updated to the member disk 
before the RAID is accessed due to the I/O operation. 

[Para 69] In Fig. 8, every time when the regional initialization is completed 
(825), the initialization progress table is written into the reserved space once 
(835). Because the addresses of the host I/O and of the reserved space are 
different, the performance is reduced. In order to raise the performance. 
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according to an embodiment of tPie present invention, tPie initialization 
progress table is updated in the memory instead of in the member disk first 
when an initialization region is completed with initialization, and the initialized 
progress table is written into the reserved space of the member disk after the 
associated host I/O is completed. If other regional initializations have been 
completed before such host I/O is completed and the initialization progress 
table is written into the member disks therefore, the updated data in 
initialization progress table due to such host I/O is also written into the 
member disks accordingly and does not have to write again. In general, this 
reduces the writing times of the initialization progress table into the member 
disks, and the performance is raised. This is shown in Fig.9A and Fig.9B. 

[Para 70] Fig.9A and Fig.QB are diagrams of another embodiment of an 
induced consistency initialization according to the present invention. As shown 
in Fig.9A, when the RAID controller receives a host I/O accessing the RAID, it is 
tested to see if the consistency initialization of the RAID is completed (905). If 
step (905) is affirmative, the RAID is accessed according to the I/O operation 
(950). If step (905) is negative, the initialization progress table is checked to 
determine whether the initialization of the associated initialization region is 
completed (91 0). If step (91 0) is affirmative, the RAID is accessed according to 
the I/O operation (950). If step (91 0) is negative, it is necessary to test if the 
associated initialization region is being initialized (91 5). If the step (91 5) is 
affirmative, step (920) of waiting for the completion of the initialization of the 
associated initialization region is performed, and then the RAID is accessed 
according to the I/O operation (950), if the step (91 5) is negative, the 
associated initialization region starts to be initialized (925), and then the 
initialization progress table is updated into the memory after the initialization 
is completed (930), and further, the RAID is accessed according to the I/O 
operation (950). 

[Para 71 ] In Fig. 8 and Fig.9A, in steps (81 0) and (91 0), the completion of the 
regional initialization includes completion of updating the initialization 
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progress state to the initialization progress table. In steps (81 5) and (91 5), the 
state of being initialized of the regional initialization includes updating the 
initialization progress state to the initialization progress table. In step (820) 
and (920), waiting for the initialization region initialized includes waiting for 
updating the initialization progress state to the initialization progress table. 

[Para 72] After the RAID access resulting from the I/O operation is completed 
(950), the initialization progress table will be stored into the member disks. As 
shown in Fig.9B, it is checked to see if the consistency initialization of the RAID 
is completed (955). If step (955) is affirmative, which means that the 
consistency initialization is completed and the initialization progress table 
does not need to be stored in the member disk. The RAID controller reports 
the I/O operation result to the host entity (985). If step (955) is negative, 
testing to determine if the updated initialized progress table is stored in the 
member disk (960). If step (960) is affirmative, which means that the updated 
initialization progress table has been stored in the member disk, so that the 
updated initialization progress table does not have to be stored in the member 
disk again, and the RAID controller reports I/O operation result to the host 
entity (985). If step (960) is negative, checking if the updated initialization 
progress table is being stored into the member disk (965). If step (965) is 
affirmative, step (975) of waiting for the completion of storing the updated 
initialization progress table into the member disk is performed, and then the 
RAID controller reports the I/O operation result to the host entity (985). If step 
(965) is negative, step (970) of storing the updated initialization progress table 
is performed, and then step (975) of waiting for the completion of the storing 
the updated initialization progress table into the member disk is performed, 
and then the RAID controller reports the I/O operation result to the host entity 
(985). After the RAID controller reports the I/O operation result to the host 
entity (985), the I/O operation is completed (990). When the write policy of the 
RAID is set to write back mode, it is not necessary to perform step (985) to 
reports the result of the host I/O to the host entity since it has been reported 
when the RAID controller received the host I/O. 
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[Para 73] When the RAID creation is completed and before the consistency 
Initialization is completed, if the RAID subsystem is restarted when power is 
resumed after cut off or due to some other unexpected reasons, the RAID 
configuration is loaded to the RAID controller from the member disks (1 01 0) 
first (as shown in Fig. 10), and then the initialization progress table is loaded 
(1 020), and further, the RAID originally defined is presented to the host entity 
(1080) and the previous consistency initialization is continued (1050). 

[Para 74] Please refer to Fig.l 1 , which is a diagram of loading an initialization 
progress table when the RAID subsystem is restarted according to the present 
invention. First, the initialization progress tables are loaded from the member 
disks (1 1 10), and the initialization progress tables are checked to see if a 
correct initialization progress table can be identified (1 1 20). If a correct 
initialization progress table is identified, it can be used as the set initialization 
progress table (1 1 30). In a normal situation, if there has ever been an 
initialization progress table stored in a member disk, a correct initialization 
progress table should be identified (1050). Furthermore, if the consistency 
initialization is not completed, the uncompleted consistency initialization can 
be continued according to the correct initialization progress table. On the 
other hand, if no correct initialization progress table is identified, a new 
initialization progress table will be created as the set initialization progress 
table for initialization (1 140). Usually, this would not happen. Were it to 
happen, the RAID is in an unpredictable and abnormal state, and the original 
data stored in the member disks could be lost because a new initialization 
progress table is created and a new regional initialization is started. 

[Para 75] Please note that, a data structure of an initialization progress table 
may comprise the unique ID and the update sequence number of the RAID. In 
addition, the data can be protected by CRC to maintain the correctness of data. 
Moreover, each member disk can store one or more initialization progress 
tables, for example, two initialization progress tables. When an updated 
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initialization progress table is stored in the member disk, only one of the 
initialization progress tables stored therein Is updated. This prevents from 
data errors occurred in all member disks if power is cut off while the 
initialization progress table is being updated. That is, the member disk can 
store several versions of initialization progress table. More details are shown in 
Fig. 12. 

[Para 76] Please refer to Fig.l 2, which is a diagram of an embodiment of 
identifying a correct initialization progress table according to the present 
invention. Here, one member disk can store one or more initialization progress 
tables. The present inventive method checks all initialization progress tables in 
each member disk, stores a latest correct initialization progress table as a 
record or updates the earlier record of the initialization progress tables to 
determine if a correct initialization progress table can be identified after all 
member disks are all checked. 

[Para 77] First, step (1 205) of detecting whether there is a member disk which 
has not been detected yet is performed. If step (1 205) is affirmative, step 
(1 207) of detecting if in that member disk there is an initialization progress 
table which has not been checked yet. Since each member disk has more than 
one version of initialization progress table, it is necessary to check and identify 
the latest and correct initialization progress table. If step (1 207) is affirmative 
which means that there exists an initialization progress table which is not 
checked, and the unique ID of the member disk is checked to see if the disk is 
a member disk of the RAID and the initialization progress table is checked to 
see if the initialization progress table is consistent with the CRC data (1210), 
where the CRC data is generated by calculating the initialization progress table 
and stored in the reserved space. If the disk drive does not belong to the RAID 
or the initialization progress table is not consistent with the CRC data (121 5), 
the initialization progress table can not be used and step (1207) is performed 
again to check if in the same disk drive there is still another initialization 
progress table which has not been checked yet. If the unique ID is correct and 
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the initialization progress table is consistent with the CRC data (1215) which 
mean that the initialization progress table is correct and can be used, then 
checking if there exists a correct record of the initialization progress table 
(1 220). If step (1 220) is negative, the initialization progress table is stored 
(1 225) as a correct record. If step (1 220) is affirmative, which means that a 
correct record of the initialization progress table exists earlier, the sequential 
number of the new initialization progress table is checked to see if it is later 
than that of the record (1 230). If step (1 230) is affirmative, which means that 
the new initialization progress table is latter and newer, the record of the 
initialization progress table is updated (1 235). If step (1 230) is negative, which 
means that the new initialization progress table is older, and therefore the new 
initialization progress table is not used. Here, the sequential number is used to 
store the version of the initialization progress table and, for example, the 
latter the version is, the larger the sequential number is, and it incremented 
with each updated initialization progress table written into the member disks. 
Another example is a time stamp, which can also be used as the sequential 
number. When a correct initialization progress table record is created (1 225) or 
after the correct record of the initialization progress table is updated (1 235), 
step (1 207) of checking if there is an initialization progress table which has 
not been checked is performed again. 

[Para 78] Here, if step (1 207) is negative, which means that all initialization 
progress tables of the member disk are detected, step (1 205) is performed to 
check if there is still another member disk which has not been checked. If step 
(1 205) is negative, which means that all member disks are checked, at this 
time, step (1255) of checking whether a correct record of an initialization 
progress table exists is performed. If step (1255) is affirmative, step (1 260) 
shows that a correct initialization progress table is identified. If step (1 255) is 
negative, in step (1 270), it means that no correct initialization progress table is 
identified. 
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[Para 79] Although in embodiments of Fig.2A through Fig.2C, the consistency 
initialization comprises the consecutive consistency initialization and the 
induced consistency initialization, the consistency initialization can simply 
comprises the induced consistency initialization according to another 
embodiment of the present invention. In other words, only the data space 
corresponding to the host I/O is initialized, other data space is not. In 
addition, although in the above-mentioned embodiments of the present 
invention, the host I/O is often used as the I/O that accesses and induces the 
induced consistency initialization, whether the I/O is a host I/O is not the 
point. The point is, according to the induced consistency initialization of the 
present invention, when a RAID is completed with the RAID CREATE procedure, 
the RAID is allowed to be accessed by I/O and when a region accessed by an 
I/O is not performed regional consistency initialization, it will start regional 
consistency initialization on the associated region. 

[Para 80] Those skilled in the art will readily observe that numerous 
modifications and alterations of the flow may be made while retaining the 
teachings of the invention. Accordingly, the above disclosure should be 
construed as limited only by the metes and bounds of the appended claims. 
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